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Abstract 

^ , We study the approximation hardness of the Shortest Superstring, the 

Maximal Compression and the Maximum Asymmetric TraveUng Salesperson 
(MAX-ATSP) problem. We introduce a new reduction method that produces 
strongly restricted instances of the Shortest Superstring problem, in which 
the maximal orbit size is eight (with no character appearing more than eight 
times) and all given strings having length at most four. Based on this reduc- 
tion method, we are able to improve the best up to now known approximation 
lower bound for the Shortest Superstring problem and the Maximal Compres- 
sion problem by an order of magnitude. The results imply also an improved 
approximation lower bound for the MAX-ATSP problem. 



1 Introduction 

In the Shortest Superstring problem, we are given a finite set S of strings and 
we would like to construct their shortest superstring, which is the shortest possible 
string such that every string in 5 is a proper substring of it. 

The task of computing a shortest common superstring appears in a wide va- 
riety of application related to computational biology (see. e.g. [IL88II and [IL90II )- 
Intuitively, short superstrings preserve important biological structure and are good 
models of the original DNA sequence. In context of computational biology, DNA 
sequencing is the important task of determining the sequence of nucleotides in a 
molecule of DNA. The DNA can be seen as a double-stranded sequence of four 
types of nucleotides represented by the alphabet {a,c,g,t}. Identifying those 
strings for different molecules is an important step towards understanding the 
biological functions of the molecules. However, with current laboratory methods, 
it is quite impossible to extract a long molecule directly as a whole. In fact, bio- 
chemists split millions of identical molecules into pieces each typically containing 
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at most 500 nucleotides. Then, from sometimes millions of these fragments, one 
has to compute the superstring representing the whole molecule. 

From the computational point of view, the Shortest Superstring problem is an 
optimization problem, which consists of finding a minimum length superstring for 
a given set S of strings over a finite alphabet S. The underlying decision version 
was proved to be NP-complete [|MS77|] . However, there are many applications 
that involve relatively simple classes of strings. Motivated by those applications, 
many authors have investigated whether the Shortest Superstring problem be- 
comes polynomial time solvable under various restrictions to the set of instances. 
Gallant et al. [IGMS80I] proved that this problem in the exact setting is still NP- 
complete for strings of length three and polynomial time solvable for strings of 
length two. On the other hand, Timkovskii [|T90[| studied the Shortest Superstring 
problem under restrictions to the orbit size of the letters in E. The orbit size of a 
letter is the number of its occurrences in the strings of S. Timkovskii proved that 
this problem restricted to instances with maximal orbit size two is polynomial time 
solvable. He raised the question about the status of the problem with maximal or- 
bit size k for any constant A; > 3. It is known that the Shortest Superstring problem 
remains NP-hard for the following strongly restricted instances, such as 

(i) all strings have length four and the maximal orbit size is six [IM94I1 , 

(ii) the size of the alphabet of the instance is exactly two HGMSSOII . and 

(in) all strings are of the form lOplOq, where p, g e N [|M98|| . 

In order to cope with the exact computation intractability, approximation algo- 
rithms were designed to deal with this problem. The first polynomial time ap- 
proximation algorithm with a constant approximation ratio was given by Blum et 
al. 0BJL+94II . It achieves an approximation ratio 3. This factor was improved in 
a series of papers yielding approximation ratios of 2.88 by Teng and Yao [|TT93[| : 
2.83 by Czumaj et al. [ICGP+941] : 2.79 by Kosaraju, Park, and Stein [IKPS94I] : 2.75 
by Armen and Stein [IAS95II : 2.67 by Armen and Stein [IAS98II : 2.596 by Breslauer, 
Jiang, and Jiang [IBJJ97II and 2.5 by Sweedyk [|S99|] . The currently best known ap- 
proximation algorithm is due to Mucha [1M12I1 and yields an approximation ratio 
of 2.478. 

On the lower bound side, Blum et al. [IBJL+94II proved that approximating the 
Shortest Superstring problem is APX-hard. However, the constructed reduction 
produces instances with arbitrarily large alphabets. In [I099II , Ott provided the 
first explicit approximation hardness result and proved that the problem is APX- 
hard even if the size of the alphabet is two. In fact, Ott proved that instances 
over a binary alphabet are NP-hard to approximate with an approximation ratio 
17246/17245 (1.000057) - e for every e > 0. In 2005, Vassilevska HVOSll gave an 
improved approximation lower bound of 1217/1216 (1.00082) by using a natural 
construction. The constructed instances of the Shortest Superstring problem have 
maximal orbit size 20 and the length of the strings is exactly 4. 

In this paper, we prove that even instances of the Shortest Superstring 
problem with maximal orbit size 8 and all strings having length 4 are NP-hard to 



2 



approximate with less than 333/332 (1.00301). 

Maximal Compression problem. We are given a collection of strings 
S = {si, . . . ,Sn}. The task is to find a superstring for S with maximum com- 
pression, which is the difference between the sum of the lengths of the given 
strings and the length of the superstring. 

In the exact setting, an optimal solution to the Shortest Superstring problem 
is an optimal solution to this problem, but the approximate solutions can differ 
significantly in the sense of approximation ratio. The Maximal Compression prob- 
lem arises in various data compression problems (cf. ['SS82 ], [IS88I1 and [|MJ75|] ). 
The decision version of this problem is NP-complete [|MS77|| . Tarhio and Ukko- 
nen HTUSBII and Turner [|T89[| gave approximation algorithms with approximation 
ratio 2. The best known approximation upper bound is 1.5 HKLS+OSII by reducing 
it to the MAX-ATSP problem, which is defined below. 

On the approximation lower bound side, Blum et al. [IBJL+94II proved the 
APX-hardness of the Maximal Compression problem. The first explicit approxi- 
mation lower bounds were given by Ott [I099II , who proved that it is NP-hard to 
approximate this problem with an approximation factor 11217/11216 (1.000089) -e 
for every e > 0. This hardness result was improved by Vassilevska HVOSII implying 
a lower bound of 1072/1071 (1.00093) - e for any e > 0, unless P = NP. In this 
paper, we prove that approximating the Maximal Compression problem with an 
approximation ratio less than 204/203 (1.00492) is NP-hard. 

Maximum Asymmetric Traveling Salesperson (MAX-ATSP) problem. We 

are given a complete directed graph G and a weight function w assigning each 
edge of G a nonnegative weight. The task is to find a closed tour of maximum 
weight visiting every vertex of G exactly once . 

This problem has various applications and in fact, a good approximation al- 
gorithm for MAX-ATSP yields a good approximation algorithm for many other 
optimization problems such as the Shortest Superstring problem, the Maximum 
Compression problem and the Minimum Asymmetric (1, 2) -Traveling Salesperson 
(MIN-(1, 2)-ATSP) problem. The latter problem is the restricted version of the Min- 
imum Asymmetric Traveling Salesperson problem, in which we restrict the weight 
function w to weights one and two. The MAX-ATSP problem can be seen as a gener- 
alization of the MIN-(1, 2)-ATSP problem in the sense that any (^)-approximation 
algorithm for the former problem transforms in a (2 - a)- approximation algo- 
rithm for the latter problem. Due to this reduction, all negative results concerning 
the approximation of the MIN-(1, 2)-ATSP problem imply hardness results for the 
MAX-ATSP problem. Since MIN-(1, 2)-ATSP is APX-hard [lPY93ll . there is little 
hope for polynomial time approximation algorithms with arbitrary good precision 
for the MAX-ATSP problem. On the other hand, the first approximation algorithm 
for the MAX-ATSP problem with guaranteed approximation performance is due to 
Fisher, Nemhauser, and Wolsey [IFNW79II and achieves an approximation factor 
of 2. After that Kosaraju, Park, and Stein [IKPS94II gave an approximation algo- 
rithm for that problem with performance ratio 1.66. This result was improved by 
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Blaser [IB02II who obtained an approximation upper bound of 1.63. Lewenstein 
and Sviridenko [ILS03II were able to improve the approximation upper bound for 
that problem to 1.60. Then, Kaplan et al. HKLS+OSII designed an algorithm for the 
MAX-ATSP problem yielding the best known approximation upper bound of 1.50. 

On the approximation hardness side, Engebretsen [|E99|] proved that, for any 
e > 0, there is no (2805/2804 - e) -approximation algorithm for MIN-(1, 2)-ATSP, un- 
less P = NP, which yields an approximation lower bound of 2804/2803 (1.00035)- 
e for the MAX-ATSP problem. The negative result was improved by Engebretsen 
and Karpinski [lEK06ll to 321/320 (1.0031) - e for the MIN-(1, 2)-ATSP problem. It 
implies the best known approximation lower bound of 320/319 (1.0031) - e, unless 
P = NP. In this paper, we prove that approximating the MAX-ATSP problem with 
an approximation ratio less than 204/203 (1.00492) is NP-hard. 

2 Preliminaries 

In the following, we introduce some notation and abbreviations. 

Throughout, for « e N, we use the abbreviation [i] for the set 

Given an finite alphabet E, a string is an element of E*. Given two strings 

V = vr--Vn and w = wyWm over E, we denote the length of v by \v\. Furthermore, 

V is a substring of w, if m > n and there exists a j e {0, ..,n - m} such that for all 
i € [m], Vi = Wj+i. w is said to be a superstring of v if f is a substring of w. Given 
a set of strings S = {si, ...,s„} c E*, a string s e S* is a superstring for 5" if s is 
a superstring of every Si e S. Given a superstring s for S, the compression of s, 
denoted comp{S,s), is defined as 

comp{S, s) = ^ \si\ - \s\. 

In addition, we introduce the notion of the maximal orbit size of S being the max- 
imal number of occurences of a character in S. We are ready to give the definition 
of the Shortest Superstring problem and the Maximal Compression problem. 

Definition 1. Given an alphabet E and a set of strings S = c E* such 

that no string in S is a substring of another string in S, in the Shortest Superstring 
problem we have to find a string s for S of minimum length, whereas in the Max- 
imum Compression problem, we have to find a superstring s for S with maximum 
compression. 

In the following, we concentrate on the traveling salesperson problems. We begin 
with the definition of the MAX-ATSP problem. For this reason, we introduce the 
notion of a Hamiltonian tour. Given a directed graph G = {V,A), a Hamiltonian 
tour is a cycle in G visiting each vertex of G exactly once. 

Definition 2 (MAX-ATSP). Given a complete directed graph G = {V, A) and a weight 
function w assigning each edge of G a nonnegative weight, the MAX-ATSP problem 
consists of finding a Hamiltonian tour of maximum weight in G. 
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Next, we give the definition of the MIN-(1, 2)-ATSP problem, which is closely re- 
lated to the MAX-ATSP problem. 

Definition 3 (MIN-(1, 2)-ATSP). In the MIN-{1,2)-ATSP problem, we are given a 
complete directed graph G = {V, A) and a weight function w : A ^ {1,2}. The task is 
to find a Hamiltonian tour of minimum weight in G. 

3 Related Work 

In the following, we present some results related to the problems studied in this 
paper. In particular, we describe briefly some reductions, which we use later on. 

The following theorem is due to Vassilevska HVOSII and deals with best known 
approximation lower bounds for the Shortest Superstring problem as well as for 
the Maximal Compression problem. 

Theorem 1 C flVOSin . For any e > 0, it is NP -hard to approximate the Shortest 
Superstring problem and the Maximal Compression problem restricted to instances 
with equal length strings in polynomial time within a factor of 

• 1.00082 -e and 

• 1.00093- e, respectively 

In addition, the maximal orbit size of the constructed instances in H VP 511 is 20 
and all strings have length four. In the same paper, it was proved that the Short- 
est Superstring problem is the hardest to approximate on instances over a binary 
alphabet. 

Theorem 2 r HVOSH '). Suppose the Shortest Superstring problem can be approximated 
by a factor a on instances over a binary alphabet. Then, the Shortest Superstring 
problem can be approximated by a factor a on instances over any alphabet. 

Given an instance S of the Shortest Superstring problem, consider the associated 
weighted complete graph, in which the vertices are represented by the strings in 
S and the weight of an edge is given by the the number of maximum overlapped 
letters of the corresponding strings. Then, the optimal compression is equivalent 
to the weight of a maximum Hamiltonian path. By introducing a special vertex 
representing the start and the end of the Hamiltonian cycle, the Maximal Com- 
pression problem is equivalent to the MAX-ATSP problem on this graph. This fact 
was used in [IKLS+051] in order to obtain an improved approximation algorithm for 
the Maximal Compression problem. 

Fact 1. An a-approximation algorithm for the MAX-ATSP problem implies an a- 
approximation algorithm for the Maximal Compression problem. 
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Another interesting relation can be derived by replacing all edges with weight two 
of an instance of the MIN-(1, 2)-ATSP problem by edges of weight zero and then, 
computing a Hamiltonian tour of maximum weight. Vishwanathan [1V92II proved 
that this transformation relates the MIN-(1, 2)-ATSP problem to the MAX-ATSP 
problem in the following sense. 

Theorem 3 r HV92]| "). An {^)- approximation algorithm for the MAX-ATSP problem 
implies an (2 - a)- approximation algorithm for the M/N-(l, 2)-ATSP problem. 

Due to this reduction, every hardness result concerning the MIN-(1, 2)-ATSP prob- 
lem can be transformed into a hardness result for the MAX-ATSP problem. The 
best known approximation lower bound for the MIN-(1, 2)-ATSP problem is proved 
in [1EK06II and it yields the following hardness result . 

Theorem 4 ([EK06]). For any constant e > 0, it is NP-hard to approximate the 
MIN-{1, 2) -ATSP problem with an approximation ratio 1.0031 - e. 

According to Theorem [3l it implies the hardness result for the MAX-ATSP problem 
stated below. 

Corollary 1. For any constant e > 0, it is NP-hard to approximate the MAX-ATSP 
problem within 1.0031 - e. 

3.1 Hybrid Problem 

In their paper on approximation hardness of bounded occurrence instances of sev- 
eral combinatorial optimization problems, Herman and Karpinski [IBK99II intro- 
duced the Hybrid problem and proved that this problem is NP-hard to approxi- 
mate with some constant. 

Definition 4 (Hybrid problem). Given a system of linear equations mod 2 containing 
n variables, m2 equations with exactly two variables, and equations with exactly 
three variables, find an assignment to the variables that satisfies as many equations 
as possible. 

In the aforementioned paper, Berman and Karpinski proved the following hardness 
result. 

Theorem 5 C IIBK99I1 ). For any constant e > 0, there exists instances of the Hybrid 
problem with A2v variables, 60z/ equations with exactly two variables, and 2v equa- 
tions with exactly three variables such that: 

(i) Each variable occurs exactly three times. 

(ii) Either there is an assignment to the variables that leaves at most ev equations 
unsatisfied, or else every assignment to the variables leaves at least (1 - e)v 
equations unsatisfied. 

(Hi) It is NP-hard to decide which of the two cases in item (ii) above holds. 
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Analyzing the details of their construction, it can be seen that every instance of the 
Hybrid problem produced by it has an even more special structure. The equations 
containing three variables are of the form x ® y ® z = {0,1}. Those equations arise 
from the Theorem of Hastad HHOIH concerning the hardness of approximating 
equations with exactly three variables called the MAX-E3-LIN problem, which can 
be seen as a special instance of the Hybrid problem. 




Figure 1: An example of a Hybrid instance with circles C^, Cy, and hyperedge 

e = {zT,y2i,xi4}. 

For every variable x of the original instance £^3 of the MAX-E3-LIN problem, they 
introduced a corresponding set of variables Vx- If the variable x occurs times in 
£^3, then, Vx contains It^ variables xi, . . . ^xjt^. Furthermore, the variables in Vx are 
connected by equations of the form = with i e [Itx - 1] and xi ® x-jt^ = 0. 

This construction induces the circle C'-^ on the variables Vx- In addition to it, 
every circle possesses an associated matching AP. The variables contained 
in ConiVx) = {xi \ i e {7v \ v e [tx]}} are called contact variables, whereas the 
variables in Vx\Con(Vx) are called checker variables. 

Let S3 be an instance of the MAX-E3-LIN problem and y. be its corresponding 
instance of the Hybrid problem. We denote by ¥(8^) the set of variables which 
occur in the instance £3. Then, TL can be represented graphically by |F(^3)| cir- 
cles with X e ¥{£3) containing the variables V{C^) = {xi, . . . ,Xn^} as vertices. 
The edges are identified by the equations included in H. The equations with ex- 
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actly three variables are represented by hyperedges e with cardinaUty |e| = 3. The 
equations x-i ® xi+i = induce a circle containing the vertices {xi, . . . , Xn^} and the 
matching equations Xi®Xj = with {i,j} e induce a perfect matching on the set 
of checker variables. An example of an instance of the Hybrid problem is depicted 
in Figured! 

In summary, we notice that there are four type of equations in the Hybrid 
problem (i) the circle equations Xi ® Xj+i = with i e [7tx - 1], (ii) circle border 
equations xi ® xn^, (in) matching equations Xi ® xj = with {i,j} e M^, and (iv) 
equations with three variables of the form x ® y ® z = {0,1}. 

In the remainder, we assume that equations of the form x ® y ® z ^ {0,1} contain 
only unnegated variables due to the transformation x&y®z = = x®y®z = l. 

4 Our Contribution 

We now formulate our main result. 

Theorem 6. Given an instance % of the Hybrid problem with n circles, m2 equations 
with two variables and rris equations with exactly three variables with the proper- 
ties described in Theorem |5l we construct in polynomial time an instance 8% of the 
Shortest Superstring problem and Maximal Compression problem with the following 
properties: 

(i) If there exists an assignment to the variables of H which leaves at most u 
equations unsatisfied, then, there exist a superstring for S-^ with length at 
most \s^\ = 5m2 + IQm-^ + 7n + u. 

(ii) From every superstring s for S-h with length \s\ = 5m2 + 16m3 + u + 7n, we can 
construct in polynomial time an assignment ifjg to the variables ofH that leaves 
at most u equations in % unsatisfied. 

(in) If there exists an assignment cf) to the variables of H which leaves at most u 
equations unsatisfied, then, there exist a superstring s^for 5% with compression 
at least comp(S-u, s^) = 3m2 + 12ms - u + 5n . 

(iv) From every superstring s for 5-^ with compression comp(Sfi, s) = 3m2 + 12ms " 
u + 5n, we can construct in polynomial time an assignment i(js to the variables 
ofH that leaves at most u equations in % unsatisfied. 

(v) The maximal orbit size of the instance Sn is 8 and the length of each string in 
Sh is 4. 

The former theorem can be used to derive an explicit approximation lower bound 
for the Shortest Superstring problem by reducing instances of the Hybrid problem 
of the form described in Theorem [5] to the Shortest Superstring problem. 

Corollary 2. For every e> 0, it is NP-Ziard to approximate the Shortest Superstring 
problem with an approximation factor ||| (1.00301) - e. 
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Proof. First of all, we choose constants k €N and 6 > such that r^^^+s+i^ - III ~ ^ 
holds. Given an instance £"3 of the MAX-E3-LIN problem, we generate k copies 
of £3 and produce an instance H of the Hybrid problem. Then, we construct the 
corresponding instance Sy, of the Shortest Superstring problem with the properties 
described in Theorem [6l We conclude according to Theorem [5] that there exist a 
superstring for S-^ with length at most 

Ytl 7 • 6 

5 • 60uk + 16 • 2uk + 6uk + 7n< (332 + 6 + —)vk < (332 + 6 + —)vk 

ku k 

or the length of a superstring for 5^ is bounded from below by 

5 • GOiyk + 16 • 2z/A; + (1 - 6)uk + 7n> (332 + (1 - 6))uk > (333 - S)iyk. 

From Theorem [Sj we know that the two cases above are NP-hard to distinguish. 
Hence, for every e > 0, it is NP-hard to find a solution to the Shortest Superstring 
problem with an approximation ratio ^^^^^^ii ^ fH ~ □ 

k 

Analogously, Theorem [6] can be used to derive an approximation lower bound for 
the Maximal Compression problem. 

Corollary 3. For every e> 0, it is NP-hard to approximate the Maximal Compression 
problem with an approximation factor 204/203 (1.00492) - e. 

By applying Fact [U we obtain the following hardness result for the MAX-ATSP 
problem. 

Corollary 4. For every e > 0, it is NP-hard to approximate the MAX-ATSP problem 
with an approximation factor 204/203 (1.00492) - e. 



5 Reduction from the Hybrid Problem 

In this section, we present the proof of Theorem [61 But before we describe the 
approximation preserving reduction, we first prove a slightly weaker result. The 
first approach uses strings with length 6 simulating equations with exactly three 
variables. In section 15. 6[ we will introduce smaller gadgets for equations with 
exactly three variables implying the claimed inapproximability results. 

Let us state the properties of our first approach. 

Theorem 7. Given an instance H of the Hybrid problem with n circles, m2 equations 
with two variables and nis equations with exactly three variables with the proper- 
ties described in Theorem we construct in polynomial time an instance of the 
Shortest Superstring problem and Maximal Compression problem with the following 
properties: 

(i) If there exists an assignment (f> to the variables of H which leaves at most u 
equations unsatisfied, then, there exist a superstring for Sy, with length at 
most \s^\ = 5m2 + 22m3 + 7n + u. 
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(a) From every superstring s for S-u with length \s\ = 5m.2 + 22m3 + u + 7n, we can 
construct in polynomial time an assignment ips to the variables of% that leaves 
at most u equations in % unsatisfied. 

(in) If there exists an assignment to the variables of H v^^hich leaves at most u 
equations unsatisfied, then, there exist a superstring s^for S-u with compression 
at least comp(S-u, s^) = 3m2 + Mms -u + 5n . 

(iv) From every superstring s for S-u wif/i compression comp{S'n, s) = 3m2 + lAm^ - 
u + 5n, we can construct in polynomial time an assignment ifjs to the variables 
of H that leaves at most u equations in H unsatisfied. 

(v) The maximal orbit size of the instance S-u is eight and the length of a string in 
S-u is bounded by six. 

Combining Theorem [5] with Theorem [71 we obtain the following explicit lower 
bound for the Shortest Superstring problem. 

Corollary 5. It is NP-hard to approximate the Shortest Superstring problem wif/i 
an approximation factor less than 345/344 (1.0029). 

Before we proceed to the proof of Theorem [71 we describe the reduction from 
a high-level view and try to build some intuition. 

5.1 Main Ideas and Overview 

Given an instance of the Hybrid problem H, we want to transform V. into an 
instance of the Shortest Superstring problem. Fortunately, the special structure 
of the linear equations in the Hybrid problem is particularly well-suited for our 
reduction, since a part of the equations with two variables form a circle and every 
variable occurs exactly three times. For every equation gi+i = Xi® Xi+i = included 
in this circle, we introduce a set S{gi+i) containing two strings, which can be 
aligned advantageously in two natural ways. If those fragments corresponding to 
two successively following equations Xi-i ® Xi = and Xi © Xi+i = use the same 
natural alignment, we are able to overlap those fragments by one letter. From a 
high level view, we can construct an associated superstring for each circle in TL, 
which contains the natural aligned strings. In fact, we define for every equation 
g eH an associated set of strings S{g) and the corresponding natural alignments. 
The instance S-u of the Shortest Superstring problem is given by the union of all 
sets S{g). Due to the construction of the sets S{g), there is a particular way to 
interpret an alignment of the strings in S{g) included in the resulting superstring 
as an assignment to the variables in the Hybrid instance. The major challenge in 
the proof of correctness is to prove that every superstring for Sn can be interpreted 
as an assignment to the variables in the Hybrid instance H with the property that 
the number of satisfied equations is connected to the length of the superstring. 
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5.2 Constructing 8% from H 



Given an instance of the Hybrid problem T-L, we are going to construct the 
corresponding instance Su of the Shortest Superstring problem. Furthermore, we 
introduce some notations and conventions. 

For every equation g e H, we define a set S{g) of corresponding strings. 
The corresponding instance Sy, of the Shortest Superstring problem is given by 
'Sn= U S(g). The strings in the set S{g) differ by the type of considered equation 

g ^ H. Let us start with the description of S-u- Therefore, we need to specify the 
instance of the Hybrid problem more precisely. 

Let £"3 be an instance of the MAX-E3-LIN problem and TL its corresponding 
instance of the Hybrid problem with n circles. For every variable x e V{£3), there 
is an associated circle C^. Each circle consists of - 1 circle equations g^_^-^ 
with i e [mf - 1], a circle border equation gf = xi ® Xm| = and \M^\ matching 
equations g^ with e e M^. Furthermore, we have ma equations g^ with exactly 
three variables. We are going to specify the sets S{g) differing by the type of 
equation g. In particular, we distinguish four types of equations contained in H. 

(i) circle equations 

(ii) matching equations 

(iii) circle border equations 

(iv) equations with exactly three variables 

Let us begin with the description of the strings corresponding to circle border 
equations. 

Strings Corresponding to Circle Border Equations 

Given an instance of the hybrid problem H, a circle in H and its circle border 
equation gf = xi ® Xn = 0, we introduce six associated strings, that are all included 
in the set S{gf). Due to the construction of the circle C^, the variable is a 
contact variable. This means that x„ appears in an equation g^ with exactly three 
variables. The strings in the set S{gf) differ by the type of equation g^^. We begin 
with the case g^ = Xn ® y ® z = 0. 

The string L^C^. is used as the initial part of the superstring corresponding 
to this circle, whereas C^Rx is used as the end part. Furthermore, we introduce 
strings that represent an assignment that sets either the variable xi to or the 
variable a;„ to 1. The corresponding two strings are 

Finally we define the last two strings of the set S{gf) 

^x-^1 -^n ^x """-l -^n ^x^x-^1 ■ 
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having a similar interpretation. Both pairs of strings can be overlapped by two 
letters. Those natural alignments have a crucial influence during the process 
of constructing a superstring. For this reason, we introduce a notation for this 
alignments. By the 0-alignment of the strings in S{gf), we refer to the following 
alignment of the four strings. In the following, (j) will denote the overlapping of 
the strings. 



W mO il^r c3nA rr^^ rir r-il raO /^l rl mOr~ir „--,J mO^r^i rl 

i i 

r^rnO ml r~ir r^l mO anH r/,^^ r'r rl mO /^r 



On the other hand, we the define the 1-alignment of the strings in S(gf) as 
follows. 



W mO„n^r „_,J llr-ir/^l mO /^l rl mOrir „_,J ™mO/^r^i rl 

^x-^1 "''n'-^i: '^"'^ -^n '-^a:'-^a;-^l "-^a;-^l -^n "-^x ^^^^ -^n ^x^x^l ■ 

i i 

onri /^i ™rl mO/^r^i rl 



Both ways to join the four strings are called simple alignments. 

After having described how the strings corresponding to S{gf) in case of 
= Xn®y® z = are defined, we are going to deal with the case gj = Xn®y®z = 1. 
As before, we use L^Cl, as the initial part of the superstring corresponding to 
this circle, whereas C^Rx is used as the end part. Furthermore, we define the 
remaining four strings contained in S{g^) by the following. 



^xX\ X^ 


and 


^mlf^r/^l „mO 
■^n ^x^x-^l 


r^l rl lOfir 
^x-^l -^n ^x 




I0f~irfil rl 
■^n ^x^x-^l 



Both pairs of strings can be overlapped by two letters. We introduce a notation 
for this alignments. 



llfyrfil mO llfir 
■^n^x^x-^l -^n^x 
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^x^x-^1 ^x-^1 



i 



'^x-^1 -^n ^x'^x-^1 



and 



I0f-irf~il rl lOffr 
•^n ^x'-^x-^l -^n ^x 



The former introduced alignment is called the 0-alignment of the strings in S{gf). 
On the other hand, we the define the 1-alignment of the strings in S{gf) as 
follows. 



/^l mO ml/^r „-^J ml/^r^l mO /^l rl lOr-ir „--,J I0r~ir^l rl 

^x^'^l "^n ^x di-iu. K^^y^^ 

■^1 ^x-^1 
i i 



In the remainder, we refer to both ways to overlap the four strings as simple 
alignments. Next, we describe the strings corresponding to matching equations. 



X. 



ml /^r /^l ^mO ^ml /^r 



V/~ir m 



X^ 



Strings Corresponding to Matching Equations 

Let be a circle in % and its associated perfect matching. Let be an 
edge in and g^^. .-^ = Xi®Xj = the associated matching equation. We now define 
the corresponding set S{g'-^,._^y) consisting of two strings assuming i < j. Then, we 
introduce two strings of the form 

rp'rO 10 rl 11 „^J rl 11 rO 10 

J J % % ClAJ-vJ. % % J 3 

corresponding to the matching equation. There are two ways to align those two 
strings to obtain an overlap of two letters. In the remainder, we refer to those 



alignments 


as simple. 






X j % % 


and x'-^x^lxfx^f 












, * s 




rO 10 rl 11 rO 10 


X^-^ X^^ X^^ X^^ X^-^ X^^ 




^ , ' 
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The first way to overlap the strings is called the 0-alignment, whereas the second 
one is called the 1-alignment. Next, we describe the strings corresponding to circle 
equations. 



Strings Corresponding to Circle Equations 

Let be a circle in H and its associated matching. Furthermore, let {i,j} and 
{i + be both contained in M^. We assume that i < j. Then, we introduce the 
corresponding strings for Xj ® Xi+i = 0. If ? + 1 < j', we have 

„il™rl mO mO nnrl ^mO mO 11 rl 

Otherwise {i + l> j'), we have 

11 ml mO rO nnrl ™mO„rO ^il ml 

•^j -^i+l-^i -^i+l diiu. Xj Xj,,.]^. 

In case oi i> j and i + 1 > j', we use 

ml ml iO rO „_J 10 rO ml ml 

•'^i -^i+l-^i -^j+l aiiu. Xj Xj^j^Xj -^j+l 

Finally, if i > j and i + 1 < we introduce 

ml rl 10 mQ „--,J 10 mO ml rl 

•^i -^i+l-^i -^i+l aiiu. Xj_|_]^Xj xj_^]^ 

Let be a variable in contained in an equation g'^- with three variables. We now 
define the corresponding strings for the equations - and Xi © x^+i = 0. 

We assume that {i - l,j} and {i + are both included in M^. Furthermore, we 
assume i - 1 < j and i + 1 < j'. If the equation is of the form Xi ® y ® z = 0, we 
introduce 

11 rl mO mO cir\r\ ™mO mO H rl 

Xj_]^Xj Xj_|Xj aiiu. Xj_]^Xj Xj_]^Xj . 

for ® = 0. Furthermore, for Xi ® Xj+i = 0, we use the strings 

^Zl rl mO mO anrl „mO„mO™n rl 

On the other hand, if the equation g^ is of the form Xi ® y ® z = l,we introduce 

11 ml mO rO nnrl ™mO™rO„;i „ml 

corresponding to the equation 0. For = 0, we use the strings 

„ml rl 10 mO pnrl ™iO™mO™ml rl 

Xj^j^Xj dllU. Xj Xj^j^Xj J'j+l- 

Accordingly we introduce the notation of simple alignments for the strings in 
S(g^^^). For the strings 

ml ml rO 10 „_,J rO 10 ml ml 

•^i '^i+l'^i -^j+l aiiu Xj Xj^j^Xj Xj^j^, 

we define the following alignments as simple. 

ml ml rO^ZO ^rO^ZO ™ml ml 



ml ml rO„/0 ™ml ml anrl ^rO™/0 ™ml ml rO™/0 

■^j •^i+l"''i "''i+l'^i -^i+l aiiu Xj Xj_,.]^Xj Xj^j^Xj Xj^.]^ 



^rO^jZO ^ml^ml ^ml^ml^rO^ZO 
i i + 1 i i+1 i i+1 i i + 1 



The the former alignment is called the 1-alignment and the latter one is called the 
0-alignment. Next, we describe the strings corresponding to equations with three 
variables. 
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Strings Corresponding to Equations with Three Variables 

We now concentrate on equations with exactly three variables. Let be an 
equation with three variables in H. For every equation g^, we define two corre- 
sponding sets S'^{g^) and S^{g^) both containing exactly three strings. Finally, 
the set S{g^) is defined by the union S^{gj) u S^(g^). We distinguish whether g^ 
is of the form x®y®z = 1 or x®y®z = 0. The description starts with the former case. 

An equation of the form x ® y ® z = Ois represented by S'^{gj) containing 
the strings 

x'^A]x^^y'-^A^jy^^ y^^A]y^^x'^^A]Cj x"^^ A]C jx''^ A^^ 
and by S^{gj) containing the strings 

x'^B}x^h''^B]z^^ z^'^B^z^^CjB^x'^^ CjB^x'^^x'^B^x^^ 

On the other hand, for equations of the form g^ = x®y®z = 1, we introduce S^(gj) 
containing the following strings. 

y'-^A^jy^^x^'^A^Cj x"^^ A]C jx"-^ A^^ 
Furthermore, we give the definition of S^{gj) including the following strings. 

^r0^1^«0^r0^2^/0 z''^ B] Z^^ C j B^ x""^ C j B^^ x'^^ x""^ B] X^^ 

The strings in the set S^{gj) can be aligned in a cyclic fashion in order to obtain 
different strings which we will use in our reduction. Every specific alignment pos- 
sesses its own abbreviation given below. 



x''^A]x^^y'-^A]y^^ y"-^ A]y^^ x"^"" A]C , 


■mO /iS/^ r 


^Ay' 


















, * ^, * ^ 

x^^A]x^^y''^A]y^^x'^^A]CjX''^A]x^^ 


— nf"^ /I , T* 

— X -tIjX 


called x^ 


- alignment 


yr'^A^.y''>-x"^0A3Cj 








i 








3 ■' 3 








y'^A]y^^x'^^A]CjX'^A]x^^y''^A]y^^ 


E y-^A.y 


'■^ called y^ 


- alignment 










i 








. * s 

x'^^Apjx''^A]x^^y'-^A]y^^x"'^A^jCj = 


x'^^A^Cj 


called left- 


alignment 


3 3 ^3^ 3 
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Analogously, the strings in S^{gj) can also be aligned in a cyclic fashion. We are 
going to define the abbreviations for these alignments. 



x''^B]x^^z''^B]z^^ z''^B]z^^CjB^x"''^ CjB]x'^^x''^B]x^^ 

i 



J J 



CjB^x"'°x'^B]x^h''^B]z^^CjBlx"'° = CjBjx""^ called right-x°- alignment 

i 



■"J J 3 ■' 3 



x''^B]x^h''^B]z^^CjB^x"'^x''^B]x^^ = x^'^Bjx^^ called x^- alignment 



3 ■'3 



■'3 J 



z'^B]z^^CjB^x"'^x'"^Bjx^^z''^B]z^^ = z'^BjZ^^ called z^- alignment 



, V 

^'-i_B?2"C,_B?x™0 x^^B]x^^z^^B'iz^'^ 

3 3 3 3 



The strings in S^{gj) and S^{g^) can be overlapped in a special way that corre- 
sponds to assigning the value to x. 



x'-^Ay^y'-^A^jy^^ y-"^ A]y^^ x"^"" A]C , x"^"" A]C jx"^ A^^ 
z''^B]z^^CjB]x'^^ CjB]x'''^x''^B]x^^ x'^B)x^^ z''^B]z^^ 



x^^A^.x^^y^^A^.y^^ C^B^x'^^x^'^B^x^^ z^'^B'^z^^C-iB'^.x 



x'^°Apjx''^A]x^^y''^A]y^^x"'°A^jCjB]x'^^x''^B]x^^z''^B]z'^CjB^x'^° 



j.m0^3c rl^l^,;! y^l v'^ x"^o A^ C, x^'>^ B^ x^^ z''^ B^ z^^ 



In the remainder, we call this alignment the x°-alignment of S(g^) and use the 
abbreviation x"^^CjX"^^ for this string. 
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5.3 Constructing the Superstring from 



Given an assignment to the variables of l-L, we are going to construct the 
associated superstring for the instance Su- 

For every g e H, we formulate rules for aligning the corresponding strings 
in S{g) according to the assignment 0. We start with sets corresponding to circle 
border equations and circle equations. Afterwards, we show how the actual 
fragments can be overlapped with strings from the sets corresponding to matching 
equations and equations with three variables. Furthermore, we analyze the 
relation between the assignment and the length of the obtained superstring s^. 
We begin with the description of the alignment of strings corresponding to circle 
border equations in H. 



Aligning Strings Corresponding to Circle Border Equations 

Let be a circle in H and = its circle border equation. Furthermore, 

we assume that Xn is contained in a equation with three variables of the form 
Xn ® y ® z = 0. First, we set the string L^C^ as the initial part of our superstring 
corresponding to the circle C^. Then, we use the (/)(a;i)-alignment of the strings 

^x-^l -^n '^X7 -^n^x^x-^l i ^x-^l -^n ^xi '^^^^ -^n ^x^x-^l ■ 

In this condition, one of the strings si can be overlapped from the left side with 
LjjC^ by one letter. The other string Sr will be joined from the right side with 
C^Rx by one letter. This construction will help us to check whether assigns the 
same value to the variable Xn as to xi. The string s,. can be interpreted as the 
0(xi)-alignment of the strings corresponding to x„ e Xn+i = 0, since the first letter 
of Sr is either or x^l . 

The parts corresponding to a circle border equation with Xn ® y ® z = 1 can 
be constructed analogously. Next, we are going to align strings corresponding to 
circle equations. 



Aligning Strings Corresponding to Circle Equations 

Let Xi ® Xi+i = be a circle equation contained in TL. Furthermore, let the corre- 
sponding strings are given by 

In dependence of the given assignment 0, we use simple alignments to overlap 
the considered strings. More precisely, we make use of the 0(a;i+i)-alignment. For 
every pair of associated strings, we derive an overlap of two letters. We are going 
to align those fragments with strings corresponding to matching equations and 
equations with three variables. 
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Aligning Strings Corresponding to Matching Equations 

Let Xi e Xj = be a matching equation in H. Let us assume that i < j. We define 
the ahgnment of the strings in S{g'-^^,..^) according to the value of ^(xj+i). More 
precisely, we use the 0(xi+i)-alignment of the strings 

3 j i i i i j j ' 

Due to this alignment, we obtain an overlap of two letters. We are going to 
analyze the length of the resulting superstring in dependence of the assignment 
to the variables xi, x-i+i, xj and xj+i. We start with the case (f){xi+i) = cf){xj+i) = 1. 

Case (f){xi+i) = (/){xj+i) = 1: 

We use the 1-alignment of the strings and The situation 

is depicted below. (The two triangle notation > l> and <] < will be explained 
hereafter) 



^.T'O ™/0 ,^7^1 1 

J j i % 



X, 



4 



X 



ml 



„rl„Zl„rO~.iO 



b[> \> 



X 



11 





™ml 







The actual superstring s is denoted by the following sequence 

s = b \> t> 



X. 



11 



< <lm > > 



<] <e 



The part > > represents a simple alignment of the strings corresponding to 
= ending with the letter Xj e which means 



IN rs "5^ c S mO 11 rl mO mO 11 ™rl mO mO /I ™rl\ 

1^ \^ y\i t /. 



<] < denotes 



The letter in the box emphasizes the letter which can be used to overlap from 
the right side with other strings. Furthermore, the string 

^i^^i+i^r°^i+i^i^^i+i- Analogously > > 
corresponding to xj 



Xj 



X 



ml 



<] <] to denote xT^xT',\xi^x^^,^xT'^x^^''^ 



is a simple alignment of the strings 

0, where Yj e Furthermore, we use 

-^j+i-^j ^j+i- Finally b, m and e are sequences of 

letters, which we do not specify in detail. They define the remaining parts of the 
superstring s. 

If Xi = x^^ holds, we align > > Xj with x[^a;'^x^°x^°a;[^a;P to achieve an ad- 

and 



ditional overlap of one letter An analogue situation holds for > > 
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<] <1. All in all, we obtain an overlap of three letters if ^(xj) = = 1 

and (j){xj+i) = cf){xj) = 1 holds. Otherwise, we lose an overlap of one letter per 
unsatisfied equation. 

Case (t){x,+i) = = 0: 

We use the 0-alimment of the strings x^^x'^-^x^'^x':^ and x'J^xi^xr^x-^. 



6 t> > 



^rO 10 rl 11 
j j i i 



X. 



mO 



^rl 11 rO 10 



^10 
■^3 



< < e 



x^^ x^^ x^-^ J'^ 



6 O > Xi 



X 



mO 



< <1 m > > 



^7*0 ™/0 ,^7^1 ™^ 1 ^.T'O 

3 j i i j 



X 



10 



■v.rl~./l ™rO™ZO 



In this case, we use 



<\ <\ as an abbreviation for x[ 



^i+i^i ^i+i^i ^i+1 ^^a 



^10 



< < for xfxf^^x]'^x]l\xfxf^^. 



If X 



rnO 



holds, we align > > Xj 



with 



mO 



< <] and gain an additional overlap of one letter. An analogue situation 
and 



holds for l> > 



Y, 



rflO 



and 



< <] . Hence, we obtain an overlap of three letters if 
Cj+i) = = holds. If the corresponding equation 



with two variables is not satisfied, we lose an overlap of one letter. 



Case (p{x,+i) ^ (p{xj+i) = 1: 

In this case, we use the 0-alignment of the strings a;[^x-^x^°a;'° and x^^xfx^^x'-^. 



bt>[> 



j j i i 



X 



mO 



djj 



X 



ml 



< <]e 



6 > > 



X 



jr-mO 



a; 



ml 



<] <e 



J Jy^' Jy^' 



We attach x^^ x^^ x^-^ x^} x^^ x^^ at the end of our actual solution sa. without having 

3 3 J 3 ^ 

any overlap with the so far obtained superstring. Notice that we obtain in each 
case an additional overlap of one letter if the corresponding equation with two 



variables is satisfied, i.e. X 



X 



mO 



and Yi 



rptnl 
X. . 
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Case (f)(xi+i) 4^ (j){xj+i) = 0: 

According to cf), we use the 1 -alignment of the strings x] 



rl™Zl™rO™(0 



and x'^^x^^x'l^x^l . 



h\> \> 



^T'O ,-yi/O ^-yjT'l fy,l 1 



^11 



<\ <lm > t> 



2 i J J 



(0 



< <]e 



i i j 3 



b[> t> 



X; 



t 2 J j i 



X\ 



11 



We join x^-^x'-^x^^x'-^^x^^^x'-^ 



from the right side with x <] < and obtain an overlap 



of one letter. This reduces the length of the superstring by one letter independent 
of the assignment 4){xj). In case of Xi = x[^, we achieve another overlap of one 



letter, since we are able to align > > X,; from the right side with x^^x'-^x^^x^x'^^x^^. 



It corresponds to the satisfied equation Xi ® Zj+i = 0. Hence, we obtain at least the 
same number of overlapped letters as satisfied equations. 



In summary, we note that we are able to achieve an overlap of at least one 
letter in each case if the corresponding equation is satisfied by 0. Hence, we 
obtain an overlap of at most three letters. 



The other cases concerning equations Xi ® xj = with i > j can be analyzed 
analogously. Next, we are going to align strings corresponding to equations with 
three variables. 



Aligning Strings Corresponding to Equations with Three Variables 

Let e H he an equation with three variables x, y and z. Furthermore, let 
Xi^i ® X = 0, X ® Xi+i = 0, Hj^i ® y = 0, y ® yj+i = 0, z^^i ® z = and z ® z^+i = 
be the equations with two variables, in which the variables x, y and z occur 
Given the assignment (f)X.o x, y and z, we are going to define the alignment of the 
corresponding strings. Let us start with equations of the form g'^- = x ® y ® z = 
Then, we define the rule for aligning strings in S^{g'^-) and S^{gj) as follows, 
whereby we handle the cases ^(xj+i) + ^(i/j+i) + (p{zk+i) = {3,2,1,0} separately 
starting with (j){xi+i) + ^(i/j+i) + 0(2:^+1) = 3. 

Case + 0(%+i) + 0(2fc+i) = 3: 

In this case, we align the strings in S{gj) in such a way that we obtain the former 
introduced strings y^^Ajy'-^ and z^'^Bjz'-^. The situation, which we want to analyze, 
is depicted below. 
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,11 



<i <\m2 > \> 



Z z'''B, 



yll 



< < e 



Similarly to the situations that we discussed concerning matching equations, we 
define the actual superstring s in the way described below. 



s = b t> \> \X\\x^^ \ <\ < mi > t> \Y\ y^'^Ajy^^ y^^ < < m2 > \> \Z\ z'^^BjZ^'^ [z^<i < e 



Here, b, mi, m2 and e denote parts of s, which we do not specify in detail to 

emphasize the parts corresponding to the equation with three variables. 

The string x^^ < <1 denotes the (/)(xi+i)-alignment of the corresponding strings 

<] <] and 



in S{gf_^-i^). The strings 



yll 



11 



< < are defined analogously. In 



this situation, we want to analyze the cases X € {x^^,x"^^}, Y e {j/'^'^, 2/™°} and 
Z € {z"^^ , z"^^} . We infer that we obtain an overlap of four letters if all equations 
with two variables are satisfied. Otherwise, we lose an overlap of one letter per 
unsatisfied equation with two variables. 



Case + (t){yj+i) + 4>{zk+i) = 2: 

Let a, 7 € {xi+i,yj+i, Zk+i} be variables such that 0(7) = (t>{a) = 1 holds. Then, we 
use the a^-alignment and 7^-alignment of the strings in S^(gj) and S^(gj) break- 
ing ties arbitrary. We display exemplary the situation for (f){zk+i) = (f){xi+i) = 1. 



b > >|xjx''^A,x'^ |x'^ | <l Omi > [> \Y] y™o < <]rn2 > [>\Z\z'^'^ B^z^'^ l z^'^ | <] < 



i 



b>>X x^'A 



< <]mi > > 



Y 



,mO 



<] <lm2 > > 



Z z'''B. 



yll 



<\ <le 



In this case, we achieve an overlap of five letters if all equations with two variables 
are satisfied. Otherwise, we lose an overlap of one letter per unsatisfied equation 
with two variables. 



Case + (piyj+i) + (pizk+i) = 1: 
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If (f){zk+i) + (j){xi+i) = 1 holds, we align the strings in S^{gj) and S^{gj) to 
obtain x^^Ajx'-^ and z'^^Bjz'-'^. Otherwise, we make use of the strings x^'^Bjx'-^ and 
y'^^AjU^^. We display the situation for (f){yj+i) = 1. 



b > >|x|x'"^.B,x'^ | | <] <lmi > [>\Y_\y''^ Ajy^^ y^^ <] <]m2 > > |zj| | <] <]e 

4 



<] <lm2 > 



yTTtO 



< <\ex''^Bjx''^ 



Notice that we obtain an overlap of four letters if the equations with two variables 
are satisfied, i.e. X = x'"^^, Z = z"^^ and Y = y^'^. Otherwise, we lose an overlap of 
one letter per unsatisfied equation with two variables. 



Case (t)(xi+i) + + (f){zk+i) = 0: 

In this case, we use the x^-alignment of the strings in S(gj). The situation is 
displayed below. 



b ^fxIx^^Qx"" [x™o1<] <\mit> t>\ Y \\ | < <]m2 > >| Z \\ | < <]e 



i 



b[> t> X z'^^C, 



X 



mO 



< <lTni > > 



Y 



<<] 771,2 > > 



< <]e 



Here, we are able to achieve an overlap of five letters if all equations with two 
variables are satisfied, i.e. X = x"^^, Z = and Y = 7/™°. 



In summary, we state that we can achieve an overlap of at least one letter 
independent of the assignment (p. Additionally, we gain another overlap of one 
letter if the corresponding equation is satisfied by (p. 

The situation for equations of the form x ® y ® z = 1 can be analyzed analo- 
gously. We are going to define the assignment tps, which is associated to a given 
superstring for S-^. 
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5.4 Defining the Assignment 



Given a superstring s for S-u, we are going to define the associated assignment 
iIjs to the variables oi H. In order to deduce the values assigned to the variables 
in H from s, we have to normalize the given superstring s. For this reason, we 
define rules that transform a superstring for S-^ into a normed superstring for 5^ 
without increasing the length. 

First, we introduce the definition of a normed superstring for Sy,. 

Definition 5 (Normed Superstring s for 5^). Let TL be an instance of the Hybrid 
problem, Su the corresponding instance of the Shortest Superstring problem and s a 
superstring for S-y. We refer to s as a normed superstring for S% if for every g e H, 
the superstring s contains Sg as a proper substring, whereby Sg is resulted due to a 
simple alignment of the strings included in S{g). 

After having defined a normed superstring, we are going to state rules which trans- 
form a superstring for S-^ into a normed superstring for S-^ without increasing the 
length of the underlying superstring. All transformation can be performed in poly- 
nomial time. Once accomplished to generate a normed superstring, we are able to 
define the assignment 0^ and analyze the number of overlapped letters in depen- 
dence of the number of satisfied equations in H hy ipg. Let us start with transfor- 
mations of strings corresponding to circle equations and circle border equations. 

Normalizing Strings Corresponding to Circle and Circle Border Equations 

Let Xi ® Xi+i = be a circle equation in T-L. Furthermore, let and 
corresponding strings. We observe that these strings can have 
an overlap of at most one letter from the left side as well as from the right side 
with other strings in S-^. Given a superstring s for S-^, we obtain at least the 
same number of overlapped letters if we use one of the simple alignments in s. 
In particular, we have to use the simple alignment that maximizes the number of 
overlapped letters. 

Given a superstring s for S^, we separate the strings a;™°x[°^x-^x^^ and 
x'-^x^\x^^xl^-^ from s. Consequently, this results in at most three strings 

x^\mx''j^ and x[°^e such that 

„ _ Urr^O rO 11 ml II ml mO rO 

Then, we define the transformed superstring s' with at least the same number of 
overlapped letters by 

e' - U^rnO rO 11 ml mO rO ml 11 
o - •'^i+l-'^i -^i+l-^i ■^i+l^-^i+l'"'-'^i ■ 

In order to define the simple alignment, which is used in s by the strings in 
S^Qi^i), we are going to state a criterion. 
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Let s be a superstring for Su and g^^^ a circle equation. Let the correspond- 
ing strings are given by x^^ x'^^^xf and a;-^x™;5^x["°x[°p Then, we say that 
the strings in S{g^^^) use a 1-aUgnment in s if there are more strings in 
<Sn\S{gi+i) such that either is overlapped by one letter from the right side 
with a;'^x™^a;™°x[°^ or is overlapped by one letter from the left side with 
a^r°^i+i^i^^i+i ^ than strings s° in Sy\S(g^^^) such that either s° is overlapped 
by one letter from the left side with x'■^x^\x^^x^^^^ or s° is overlapped by one letter 
from the right side with in s. Otherwise, the strings in S{g^_^-^^) use a 

0-alignment in s. 

Given a superstring s for Sn, we define informally a part of the backbone 
of our transformed superstring by the strings Sg, where Sg is resulted due to a 
simple alignment used in s of the strings S{g) for every circle equation g e Ti. 
Afterwards, we use this construction to align them with strings corresponding to 
matching equations, equations with three variables and circle border equations. 
Moreover, it will help us to define the assignment tps and relate the number of 
satisfied equations to the number of overlapped letters. But first, we concentrate 
on circle border equations. 

Let xi e Xn = be a circle border equation. Furthermore, let the corresponding 
strings are given by 

IJx'^x^ ^x-'^l -^n^xi ■^n'^x^x-^l i ^x-^l -^n ^xi -^n '^x^x-^l i '^^^^ ^x^x- 

Since the simple alignments of the strings in S{gl) achieve an overlap of two 
letters for each pair {C\,x'{'^x^^Cl, x^^C^Ci^x^o} and {Cl.xl^x'^^Cl, x'^^Cj^Clxl^}, 
we argue as before that these strings can be rearranged in a given superstring for 
S-H such that the pairs use a simple alignment without increasing the length of 
the underlying superstring for Sy,. In this situation, we are able to overlap one of 
the pairs using a simple alignment with L^C^^ from the left side and the other one 
with C^Rx from the right side without increasing the length. This construction 
checks whether the variables xi and Xn have the same assigned value, which is 
rewarded by another overlap of one letter of the corresponding strings using a 
simple alignment. 

For any fixed order of the circles in H, we build the backbone of our su- 
perstring consisting of the concatenation of the strings s^sy---s^, where the string 
is associated to its circle C^. Furthermore, consists of the corresponding 
simple alignments of the strings in S{g^) used in s and the order of the strings is 
given by the order of the variables in C^. The string starts with the letter Lx 
and ends with Rx- 



Notice that similar transformations can be applied to strings corresponding 
to matching equations and to equations with three variables, but we are going 
to define the transformation for those strings in detail while analyzing the upper 
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bound of overlapped letters for simple aligned strings corresponding to circle 
equations, which are contained in a given superstring s for Sy,. 

Before we start our analysis, we define the assignment ijjg based on the ac- 
tual superstring s for 5^, which is not necessarily a normed superstring for S-^. 
By applying the transformations, which we are going to define, the assignment i(js 
will change in dependence to the actual considered superstring. 

ips(xi) = 1 if the strings in S{gi) use a 1-alignment in s 
= otherwise 

Due to the transformations for the strings corresponding to circle and circle border 
equations, the assignment i/jg is well-defined. 

Defining the Assignment for Checker Variables 

Let X € ¥{83), C be the corresponding circle in V. and its associated perfect 
matching. Furthermore, let 

and Xi ® Xj = be equations in TL, where {i,j} e Af^ and i < j holds. Let s be a 
superstring for Sy, such that the strings corresponding to circle and circle border 
equations are using a simple alignment in s. Based on the simple alignments of 
the strings corresponding to g^, g^^^, g^ and g^^^-^, which are used in the superstring 
s, we are going to define the assignment to the variables Xi and xj. Furthermore, 
we analyze the number of overlapped letters that can be achieved given the sim- 
ple aligned strings and relate them to the number of satisfied equations in Hhyiljg. 

In the remainder, we will assume that the underlying superstring for Sn 
contains simple aligned strings corresponding to circle and circle border equa- 
tions. Before we start our analysis, we introduce the notation of a constellation 
that denotes which of the simple alignments are used by the strings corresponding 
to the equations g^, g^^^, g^ and gj_^^ in s. 

Given a superstring s for S-h and {ij} e M^, a constellation c is defined by 
{XiXi+i,Xj_iXj+i)y .y with Xi,Xi^i,Xj,Xj^i e {0,1}, where X^. = 1 if and only if 
the strings in S{gl) use the 1-alignment in s for A; € {i,i + + 1}. We call a 
constellation c inconsistent if there is an entry A1A2 with Ai A2. Otherwise, c is 
called consistent. 

Based on the given constellations, we are going to define -05 • 

Definition 6 (Assignment iIjs to Checker Variables) . Let H be an instance of the 
Hybrid problem, S-u its corresponding instance of the superstring problem and s a 
superstring for S-y. Given the constellation {XiXi+i,XjXj+iY^^ .y we define ipg in the 
following way. 

(i) ips(xi) = Xi and i^s(xj) = Xj if Xi ® Xj = 1 and c is consistent 
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(a) i)s{,Xi) = Xi and il>s{xj) = Xj if Xi ® Xj = 

(in) ilJs{xi) = 1 - Xi and ipsi^Xj) = Xj if X^ ® Xj = 1 and X^ 4^ Xj+i 

{iv) il)s{xi) = Xi and i^s{xj) = 1 - Xj if Xi ® Xj = 1, Xj 4^ Xj+i and Xi = Xi+i 

We are going to analyze the the different constellations and discuss the cases 
{i)-{iv) of the definition of ips- We start with case (i). 

CASE (z) Xi ®Xj = l and c is consistent: 

There are two constellations, which we have to analyze, namely (11,00)|.^.| 
and (00, Starting with the former constellation, we obtain the scenario 

with Xi € represents a simple 

ogously the string 



depicted below. The string [> > 



Xi 



alignment of the strings in S{g^). Ana 

< < withXi+i e 



X.. 



represents a simple alignment of the strings in S{g^^^). Since we know that using 
the most profitable simple alignment of the strings in S{g^.jy) does not increase 
the length of the superstring, we make use of the 1-alignment and transform the 
superstring s in the superstring s', which are both depicted below. 



s = b [> [> 





% % J J 




< <lm > > 


^3 


3 3 * 


■^3 



s' = b\> \> 



X. 



rl 



X 



11 



<\ <\m \> l> 





„10 







j j i i 



Let us derive an upper bound on the number of overlapped letters. More precisely, 
we are interested in the number of overlapped letters being additional to the 
overlap of two letters due to the simple alignment. In both cases, either by using 
the 1-alignment or the 0-alignment of the strings in S{g^..^), we cannot obtain 
more than an overlap of two letters. It corresponds to the number of satisfied 
equations, which are Xj © Xi+i = and xj © xj+i = 0. 



In case of the constellation (00,11)^.^.^, we separate the strings x[^a;.^x^°x^° 
and x^^x'-j^x^^x'-^ from the superstring s. Then, we attach the aligned string 
^ri^ii^ro^io^ri^ii ^j-^g ^^^^ q£ ^j-^g actual solutlou. The considered situation is 
depicted below. 
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i 2 J J 


™mO 


< <]m > > 


^ml 
•^3 


J j i i 





X 



mO 



X 



ml 



6 > t> 



^mO 



a; 



ml 



^ ^ ™/0 ™rl ^/l 



j 3 i i 



In this scenario, the best that we are able to obtain is an overlap of two letters. 
This corresponds to the number of satisfied equations, namely Xi ® Xj+i = and 



Xj ® Xj^\ — 0. 



CASE (n) Xi ® = : 

Let us start with the constellation (OXj+i, OX^+i)^. .y In this case, we set V^(a^i) = 
and = 0. Given the strings [> > , Xj+i < <l, [> > and X,+i < < 



rf.mO 


3 




< <1, [> > 




and 





with Xj+i e } and Xj+i € {a;™^,x'°}, we obtain the following scenario: 







































rf.'^^ rfl^ ry-rO rpiO 




< <l?Tl t> t> 




10 rl 11 
'^3 *^ z 





< <le 



6 > > 







< <lm > > 





iO rl «1 rO iO 



mO 



j-mO 



<] <1, 



The most advantageous simple alignment in this case is the 0-alignment of the 
strings in S'(^^.^.j). If i)s{xi) = = holds, which means Xj+i 

we obtain another overlap of one letter by aligning > l> witl 
A similar argument holds for i^sixj) = ^^(xj+i) = 0. Notice that the equation 
Xi ® Xj = is satisfied by iljg. In summary, we state that we obtain an overlap of 
one additional letter per satisfied equation. Hence, we obtain an overlap of three 



letters according to the satisfied equations Xi © Xj+i = 0, x 
Consider the constellation (IX^+i, lXj+i)|.^.j 



and Xj®Xj+i = 0. 



xf 








™ml 


and 


Xj+i 



Hence, we are given the strings 
<] < with Xj+i e {x7'°,a;f} and 



Xj+i e {xY^,xf}. We obtain the scenario displayed below. 
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b[> [> 





i i j j 




< <lm > > 


^ml 


™T"0 ^-pT"! 1 
J j i i 


Xj+i 



< <]e 



i i j j 



6 > > 



rl 



In this case, we use the 1-ahgnment of the strings in S(g^.j^). If 

Tps^Xi) = tpsixi+i) = 1 holds, which means Xi+i = x^l, we obtain another 
overlap of one letter by aligning 



X, 



xf x''^ xf x'-^ x^l with 



X- 



< <1 



In case of i^s{xj) = tl^sixj+i) = 1, we may apply a similar argument. Notice that 



the equation Xi 



is satisfied by 0^. In summary, we state that we obtain 



an overlap of one additional letter per satisfied equation. Hence, we obtain an 
overlap of three letters according to the satisfied equations xi © Xj+i = 0, Xj ® xj = 



and Xj © Xj+i = 0. 



CASE (in) X, © = 1 and Xi ^ X,+i: 

Let us begin with the constellation (10,0Xj+i)^.^.j. We consider the scenario 
depicted below, in which we are given the strings > > 

and I Xj+i I < < with X^+i e {xf, xf^). 



xf 


3 




<] <], D> I> 


™«0 



6 > 







^mO 


<] Om > > 




rj-rO 10 r\ l\ 
j j i i 





i> > 



6 t> > 



X, 



mO 



<] <lm > t> 



X 



rO 



j i i j j 



Instead of using the 1-alignment of the strings in S{g^), we rather switch to 
the 0-alignment, i.e. we obtain the string > l> x™° and define tfj{xi) = 0. It 



results directly in gaining two additional satisfied equations and an overlap of 
one additional letter. As a matter of fact, we might lose an overlap of one letter. 
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because the string l> > x'" might have been aUgned from the right side with 
another string. Furthermore, the equation = might be unsatisfied. 

But all in all, we obtain at least 2-1 additional satisfied equations by switching 
the value without increasing the superstring. Notice that we may achieve an 
additional overlap of one letter if Xj+i = x^^ holds, which means that ips satisfies 
the equation Xj ® Xj+i = 0. 



The next constellation, we are going to analyze, is {01,lXj+i) 



{MY 



^11 



we are given the strings > > 
Xj+i e {a:^°,a:"^}. The situation is displayed below. 



and 



Hence, 
<] <] , with 



b\> \> 





•^i -^i -^j •'^j 


^11 


<] Om t> > 




rf.'^O 10 rl 11 

•Xj j ilj j ilj i 





<] Oe 



b[> \> 



x^ 


11 rO 10 rl 


^11 


< <\m > > 


™ml 


Xj+i 



to O > 



X 



rl 



Accord- 



We obtain a similar situation, in which we switch > > 
ingly we define tpsixi) = 1. We obtain at least one additional satisfied equation 
by switching the value without increasing the length of the superstring. Notice 
that we may achieve an additional overlap of one letter if Xj+i = x™^ holds. It 
corresponds to the satisfied equation xj e xj+i = 0. 



CASE (iv) Xi ® Xj = 1, Xj ^ Xj^i and Xi = Xi+ii 

Starting our analysis with the constellation (00, 10)|. we obtain the following 
scenario. 



™mO 


^r 1 ^yj/ 1 ^YlVO ^10 

i i j j 




<] <lm > > 






™«0 



X 



mO 



xfxfx^^x^ 



rfinO 


< <lm > > 


rr^O 

^3 


10 rl 11 rO 

tXj ^ t*y ^ ^Xj 2 *Xj J 


rJ.0 



< <e 



y.rl rr.ll rplO 



In this case, we argue that we switch the string > > 



X 



ml 



to > t> 



X 



rO 



This means 
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that we set ^s{,Xj) = 0. This transformation yields an overlap of at least the same 
number of letters, since we might lose an overlap of one letter from the left side. 
On the other hand, we align the string 



„rO 



^rl ^11 ^rO ^10 
i i 3 3 



from the right side by one letter. Notice that we gain at least one additional 
satisfied equation. 

The last constellation, we are going to analyze, is (11,01)|. ^.j. The corre- 
sponding situation is depicted below. 



6 



<1 


i i j j 


^11 


< <lm > > 


rf.rO 


3 j i i 





i i 3 3 



b> > 




i j j i 


^11 


< <\m t> t> 










v 

^rO ^10 ^rl 














3 3 i 











In this case, we switch the string > > 



■^3 



to t> I> 



•^3 



Similarly to the former 



case, this transformation does not increase the length of the superstring. By 
defining ipsixj) = 1, we achieve at least one more satisfied equation. 



In summary, we note that we achieve at least the same number of satisfied 
equations as the number of overlapped letters. By applying the defined transfor- 
mations, the actual superstring contains only strings corresponding to matching 
equations using a simple alignment. Matching equations Xj ® xj = with i > j can 
be analyzed analogously. 



We are going to define the assignment for contact variables. 



Defining the Assignment for Contact Variables 

Let = X ® y ® z = he an equation with exactly three variables in H. Given 
a simple alignment of the strings corresponding to the equations x^^ i ® x = 0, 
X e Xj^+i = 0, yj2-i ®y = 0, y® yj^+i = 0, Zj.^-i 0^ = 0, and z e Zj.j+i = 0, we are going 
to define an assignment based on the underlying simple alignments and analyze 
the number of satisfied equations in dependence of the number of overlapped 
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letters in the superstring. 



For a given superstring s for Sn and equation g 



3 - 



X 



y 



0, we define 



a constellation c given by ^1^2, ^1^2) • with Xi,X2,Yi,Y2, Zi, Z2 e {0,1}, 

where C = 1 with C e {Xi,X2, 1^1,^2, ^1,^2} if and only if the strings in the 
corresponding set are using a 1-alignment in s. A constellation denotes which of 
the simple alignments is used by the strings in s. We call a constellation incon- 
sistent if there is an entry A1A2 such that Ai A2. Otherwise, c is called consistent. 

Based on a constellation for a given superstring and an equation with 
three variables, we are going to define the assignment i/jg for the variables in g^. 

Definition 7 (Assignment i/jg to Contact Variables). Let % he an instance of the 
Hybrid problem, S-^ its corresponding instance of the superstring problem, s a super- 
string for S-H and g^ = X ® y ® z = an equation with three variables in H. For the 
associated constellation c = {XiX2,YiY2, ZiZ2)j, we define ijjg in the following way. 

(i) If c is consistent, then, we define il)s{x) = Xi, ipsiy) = Yi and iJs{z) = Zi 

(ii) Otherwise, let A1A2 be an entry in c with Ai 4^ A2 and a its corresponding 
variable. Furthermore, let /3 and 7 be variables associated with the entry B1B2 
and C1C2, respectively. If Ai®Bi ® Ci = holds, we define i/jsia) = Ax, ipsiP) = 
Bi and ^'^(7) = Ci. 

{in) Otherwise, we have Ai®Bi®Ci = 1. Then, we define ^s(a) = l-Ai, il)s{P) = Bi 
and ipsi^f) = Ci. 

We are going to analyze the following three cases and define the transforma- 
tions for the actual superstring for 5^. 

(i) Xi ®Yi ® Zi = 1 and c is consistent 

(ii) Xi ® Yi ® Zi = and c is inconsistent 

(in) Xi®Yi& Zi = 1 and c is inconsistent 
Let us begin with case (z). 

CASE (i) Xi ®Yi ® Zi = 1 and c is consistent: 

In this case, we start with the constellation (11,11,11)^. We depict the considered 
situation below. 



b \> \> 



x"^ 




< <lmi > > 




y'^A.y'^ 


yll 


< <lm2 > > 


2^1 


z^'^BjZ^^ 


^11 



6 t> 







<] <lmi > > 


yrl 


A, 


yll 


<<! 777,2 > > 






^11 



yrlAy'' 
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According to the definition of i/jg, we have ipsix) = ipsiu) = ips{z) = 1. Notice that 
the equation x®y®z = Ois unsatisfied. On the other hand, the assignment ijjg 
satisfies the equations x © Xj^+i = 0, y ® yj^+i = and z ® Zj^+i = 0. 
We note that a string corresponding to S'^{gj) or S^{gj) using a simple aUgnment 
can have an overlap of at most one letter from the right side as well as from the 
left side. Therefore, the best we can hope for is to overlap the string y''^Ay^^ with 
[> [> U*"! I and 



< < by one letter in each case. The same holds for the string 
z^^Bjz''^. Consequently, we conclude that the number of overlapped letters is 
bounded from above by four. 

In case of Xi + Yi + Zi = 1, we analyze exemplary the constellation (00,00, 11)^. 
We set iJs{z) = 1, ilJsix) = and i/js{y) = 0. This situation is displayed below. 







< <lmi > > 


ymO 


f^A.y'^ 


ymO 


< <]m2 > > 


Z^^ 


z'-^BjZ^^ 


^11 





< <lmi O > 


ymO 


<1<1 777-2 > > 


Z^^ 


B, 


^11 



X 



,mO 



Due to the 2^-alignment of the strings in S^{gj), we obtain an overlap of two 

lign t 
and 



letters. Additionally, we align the string > l> | | from the left with | \ <\ <\ . The 
same holds for > > 



y 



mO 



y 



mO 



< <l . Notice that it is not more advantageous 



to align the string x'^^BjCj with > > | x""" | , since we lose the overlap of one letter 
with x"'*^ <\ < . Hence, we are able to get an overlap of at most four letters, which 
corresponds to the satisfied equations x ® Xj^+i = 0, y ® yj2+i = and z e Zj.^+i = 0. 

CASE Xi ®Yi ® Zi = and c is inconsistent: 

First, we concentrate on the constellations with the property Xi + Yi + Zi = 2. 
Exemplary, we analyze the constellation (OX2, IY2, 1^2)^ depicted below. 



b[> [> 





X2 


<\ <\mi [> \> 


yrl 


f'A.y^' 


Y2 


<] <]m2 > > 


z^^ 


z^'^BjZ^^ 


Z2 



< <]e 



6 > t> 





X2 


< <lmi > > 


yrl 


A,y'' 


Y2 


< <1?772 > > 




B, z'' 


Z2 
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The strings O t> y'^^ and l> > z^'^ can be used to align from the right side 
with z'^^Bz'-^ and y^'^Ay'-^, respectively. It yields an overlap of two letters. If the 

■mO 



X 



corresponding equations with two variables are satisfied, which means X2 
Y2 = y'-^ and Z2 = z^^, we gain an overlap of one letter per satisfied equation. 
Notice that using the x° -alignment of Si^g^) does not yield more overlapped 
letters. In summary, it is possible to attain an overlap of at most five letters, which 
corresponds to the constellation (00,11,11)^. An analogue argumentation holds 
for the constellations (IX2, 112,0^2)^" and (1X2,0^2, 1^2 )|. 



Next, we discuss constellations with the property + Yi + 
reason, we consider the constellation (0X2,01^2,0^2)!. 



0. For this 





X2 


< <lmi > > 




y^^B.f^ 


Y2 


< <lm2 > > 


2-mO 


y-^^B^f^ 


Z2 



< <]e 





ACjBx"^^ 


X2 


<] Omi > > 


y'mO 


Y2 


< <lm2 > > 




Z2 



x"^OACi 



Recall that x"^^CjX"^^ denotes the x°-alignment of S{gj). This string can be 
aligned from the left with [> > If X2 = holds, we achieve another overlap 
of one letter Furthermore, the string > l> can be aligned from the right with 
Y2 < < a and only if Y2 = y"^^ holds. A similar argumentation can be applied 
to the strings > > and Z2 <\ <■ Finally, we note that we cannot benefit by 

aligning the string y^'^ < < with y'^^Ay'-'^. Consequently, we see that using the 
string x'^^CjX'^^ is generally more profitable. All in all, we gain an additional 
overlap of one letter for satisfying x ® y ® z = and another overlap of one letter 
if the equation with two variables corresponding to the considered variable is 
satisfied. 



CASE Xi ®Yi ® Zi = 1 and c is inconsistent: 

Let us start with constellations satisfying Xi +Yi + Zi = 3. Exemplary, we analyze 
the constellation (10, IF2, 1-^2)|- Due to the definition of tps, we set iIjs(x) = 1 - Xi, 
iJs{y) = 1 and tpsiz) = 1. Notice that i/jg satisfies the equation x ® y ® z = 0. By 
switching the value V's(^) froni Xi to 1 - Xi, the equation xj^^i ® x = might 
become unsatisfied. Furthermore, we might lose an overlap of one letter by 
flipping the 1 -alignment of the strings corresponding to to the 

0-alignment. On the other hand, we gain an overlap of one letter by aligning the 
from the right side with x' 



strmg l> l> 



X 



mO 



<] <l . This transformation yields at 
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least one more satisfied equation. In addition, the strings y'^^Ajy'-^ and z'^^Bz'-^ can 



be aligned by one letter with > > 



,rl 



respectively. If 



2 = z^^ and 



and > O 

I2 = J/'^ holds, we achieve another overlap of one letter in each case. The situation 
is depicted below. 







<\ <\mi > > 


yVl 




Y2 


<] <]m2 > > 






^2 



<] <e 



b [> [> 



^mO 


<<!>[> 






Y2 


<<!>[> 






^2 



a; 



mO 



The other constellations satisfying Xi + Fi + Zi = 3 can be analyzed analogously. 

The remaining constellations (X1X2, 1^112, ■Z^i'^2)j to be discussed satisfy 
Xi + Yi + Zi = 1 and are inconsistent. Exemplary, we analyze the constella- 
tion (01, 0^2, 1^2) For (01, 0^2, 1^2) we set ^.(x) = I - X^, tjj.iy) = Y, and 
ijs(z) = Zi. The scenario is depicted below. 





rl 4 11 




< <lmi > > 


ymO 


Y2 


<1<1 777-2 > > 




B,z'^ 


Z2 



h\>\> 




A, 













<] <lmi > > 



,mO 



<] <lm2 > > 



II 



By flipping the 0-alignment of the strings corresponding to to the 



1-alignment, we can overlap x^'^Ajx'-^ from the left side with > > x^^ and with 



X 



11 



< <] from the right side. This transformation achieves an overlap of at most 
one more letter. Moreover, we obtain at least one more satisfied equation. If 
Z2 = z'-'^ and Y2 = y"^^ holds, it yields an overlap of three additional letters, which 
corresponds to the constellation (11, 00, 11)^. 



In summary, we note that it is possible to achieve an overlap of at least one 
letter in each case. In addition to it, the assignment yields at least the same 
number of satisfied equations as the number of overlapped letters. This means 
that if ifjs satisfies the equations g'^-, x © Xj^+i =0, y ® yj^+i = and z © 2^3+1 = 0, the 
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corresponding strings in s can have an overlap of at most five letters. 



5.5 Proof of Theorem [7] 

Given an instance of the Hybrid problem with n circles, m.2 equations with 
two variables and equations with exactly three variables with the properties 
described in Theorem (Sj we construct in polynomial time an instance S-^ of the 
Shortest Superstring problem with the properties described in section [5T2l Let be 
an assignment to the variables of T-L which leaves at most u equations unsatisfied. 
According to section 15. 3[ the length of the superstring is at most 

|S(/,| < 7 ■ 72 + 5 ■ 771,2 + 22 • 7713 + 

since the length of the superstring increases by at most one letter for every unsat- 
isfied equation of the assignment. Regarding the compression measure, we obtain 
the following. 

comp{S-}i^Sff,) > ^ |s| - (7 • 77 + 5 • m2 + 22 • 777-3 + ?i) 

seS-H 

= (4 + 8)71 + 8 • 7T72 + 36 • 7773 - (7 • 77 + 5 • 7T72 + 22 ■ 777,3 + 77) 
= 577, + 37772 + 147773 ~ U 

On the other hand, given an superstring s for Sn with length 

|s| = 5777,2 + 227/73 + U + 7n, 

we can construct in polynomial time an normed superstring s' without increasing 
the length of it by applying the transformations defined in section [574l This enables 
us to define an assignment ifjg to the variables of H according section 15.41 that 
leaves at most u equations in Ti unsatisfied. A similar argumentation leads to the 
conclusion that given a superstring s for S-u with compression 

comp{S-}i,s^) = bn + 3m2 + lAm^ - u, 

we construct in polynomial time an assignment to the variables in TL such that at 
most u equations are unsatisfied. □ 

Next, we are going to describe smaller gadgets for equations with three 
variables implying an improved explicit lower bound and give the proof of 
Theorem [6l 

5.6 Proof of Theorem [6] 

Given an equation with three variables g^ = x®y®z = 0, we introduce the sets 
S"'{g^) and S^{g^) including the following strings. 



X' 
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rll3 11 rl 11 rl /l^ mO mO rl0 11 Q^f 

In addition, we introduce new strings for the equation 0. On the other 

hand, the strings corresponding to x e Xi+i = 0, yt^i ®y = 0,y® yi+i = 0, Zi-i ® z 
and z © Zi+i = remain the same. Let us define the strings for Xj^i © x = 0: 

rJl ™rl/3 n rla 11 rla mO mO mO mO 11 „r-l/3 

These three strings can be aUgned each by two letters in a cyclic fashion. Accord- 
ingly, we obtain three combinations that can be used to overlap with other strings 
by one letter from the left side as well as from the right side. Note that we have 
only two combinations if we consider only the left most position of the combined 
strings. For example, the combination 

^11 „rl^ ;i rla mO mO 11 „rl/3 
•xj ^ ^ i 1 i 1 i 1 

can be used to overlap from the right side with strings in Si^{g^-), whereas 

11 rla mO mO 11 rl/3 il rla 

can be aligned with strings contained in S°'{g^. Therefore, we may apply the 
same arguments as in the proof of Theorem [6l The strings corresponding to 
equations of the form x ® y ® z = 1 can be constructed analogously. 

Given an instance "H of the Hybrid problem with n circles, m2 equations 
with two variables and equations with exactly three variables with the 
properties described in Theorem [Sj we construct in polynomial time an instance 
S-H of the Shortest Superstring problem. Let cf) be an assignment to the variables 
of H which leaves at most u equations unsatisfied. Then, it is possible to construct 
a superstring s<^ with length 

\Sfj,\ < 7 • n + 5 • m-2 + 16 • + u, 

since the length of the superstring increases by at most one letter for every unsat- 
isfied equation of the assignment. Regarding the compression measure, we obtain 
the following. 

comp^Su^s^) > ^ \s\ - (7 ■ n + 5 ■ 1712 + 16 ■ m-s + u) 

= (4 + 8)n + 8 • m2 + 28 ■ ms - (7 • n + 5 • m2 + 16 ■ ms + u) 
= 5n + 3m2 + 12m3 - u 
On the other hand, given an superstring s for Sn with length 

\s\ = 5m2 + 16m3 + u + 7n, 

we can construct in polynomial time a normed superstring s' without increasing 
the length of it. The corresponding assignment ifjg^ to the variables of TL leaves at 
most u equations in H unsatisfied. A similar argumentation leads to the conclusion 
that given a superstring s for Sn with compression 

comp{S-u, s^) = 5n + 3m2 + 12m3 - n, 

we construct in polynomial time an assignment to the variables in % such that at 
most u equations are unsatisfied. □ 
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6 Concluding Remarks 



It seems that a new method is needed now in order to obtain better approxima- 
tion lower bounds. Perhaps direct PCP constructions are the natural next step for 
proving stronger approximation hardness results for the problems considered in 
this paper. 
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